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Abstract 

We show that the finite satisfiability problem for the guarded two- 
variable fragment with counting quantifiers is in EXPTIME. The method 
employed also yields a simple proof of the result obtained in Kazakov 
that the satisfiability problem for the guarded two- variable fragment with 
counting quantifiers is in EXPTIME. 

1 Introduction 

The two-variable fragment with counting quantifiers, here denoted C 2 , is the set 
of function-free, first-order formulas containing at most two variables, but with 
the counting quantifiers 3<c, 3>c and 3—c (for every C > 0) allowed. The 
guarded two-variable fragment with counting quantifiers, here denoted QC 2 , is 
the subset of C 2 whose formulas contain no individual constants, and where all 
quantifiers appear only in guarded patterns (explained below). Both C 2 and QC 2 
are assumed to contain equality. Neither fragment has the finite model property, 
so that their respective satisfiability problems and finite satisfiability problems 
do not coincide. It was shown in Pratt-Hartmann that the satisfiability 
and finite satisfiability problems for C 2 are both in NEXPTIME. It was shown 
in Kazakov [5] that the satisfiability problem for QC 2 is in EXPTIME; however 
the method employed in that paper yields no information about the finite sat- 
isfiability problem for QC 2 . In this paper, we show that the finite satisfiability 
problem for QC 2 is in EXPTIME; furthermore, the method employed here also 
yields a simple proof of Kazakov's result. All complexity results mentioned in 
this paper assume succinct (binary) coding of numerical quantifier subscripts. 

The fragment QC 2 is a proper superset of the "description logic" ACCQI 
(Calvanese which in turn is a proper superset of the description logic ACC 
with general concept inclusion (essentially, multimodal K with universal quan- 
tification of formulas). This latter fragment has the finite model property, and 
its satisfiability problem (= finite satisfiability problem) is EXPTIME-hard. 
Hence, the complexity bounds reported here for QC 2 are tight. Furthermore, it 
was shown in Lutz et al. IHUHj that the finite satisfiability problem for ACCQI 
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is in EXPTIME. The present paper thus extends that result to the whole of 

gc 2 . 

By contrast, QC 2 is a proper subset of C 2 , whose satisfiability and finite 
satisfiability problems are NEXPTIME-hard. In fact, even the 2-variable frag- 
ment of first-order logic without counting quantifiers, which has the finite model 
property, exhibits a NEXPTIME-hard satisfiability problem, as can be shown 
by using its formulas to encode exponentially large grids (see, e.g. Borger et 
al. PQ, PP- 253 ff.). In such encodings, the quantifiers (V and 3) appear in un- 
guarded patterns. On the other hand, in the presence of counting quantifiers, 
a similar encoding is possible using only guarded quantification, provided that 
just one individual constant is admitted to the language. Thus, both the lack 
of individual constants and the restriction to guarded patterns of quantification 
are essential to the comparatively low complexity of QC 2 (assuming, of course, 
that EXPTIME ^ NEXPTIME). 

It was shown in Pratt-Hartmann [2], Corollary 1 that, if a formula <fi of 
C 2 is finitely satisfiable, then it has a model whose size is bounded by a doubly 
exponential function of the number of symbols in <p. This bound is optimal, even 
for the fragment QC 2 , in the sense that there exists a sequence {&} of finitely 
satisfiable formulas of QC 2 C C 2 whose size grows as a polynomial function of i, 
but whose smallest satisfying structures grow as a doubly exponential function of 

1 (Gradel et al. 0, p. 317). In view of this lower bound on the sizes of smallest 
satisfying structures, the upper complexity bound on the finite satisfiability 
problem for QC 2 proved here is noteworthy. 

The plan of this paper is as follows. Sectional defines the fragment QC 2 and 
establishes some basic results. Section describes a procedure for transforming 
any C/C 2 -formula into an (exponentially large) constraint satisfaction problem 
with variables ranging over N. Section 0] uses this transformation to prove that 
the finite satisfiability problem for QC 2 is in EXPTIME. Section [5] is dessert. 

2 Preliminaries 

We restrict consideration to first-order languages whose only primitive symbols 
are the variables x and y, the usual Boolean connectives, the quantifiers V, 3, 
3<c, 3>c 3 = c (for all C > 0), the equality predicate (here written as «) 
and a non-logical signature of 0-ary, unary and binary predicates. There are 
no individual constants or function-symbols in these languages. If p is any 
binary predicate (including «), we call an atomic formula having either of the 
forms p(x, y) or p(y, x) a guard-atom. The two-variable guarded fragment with 
counting quantifiers, QC 2 , can then be defined as the smallest set of formulas 
satisfying the following conditions: 

1. QC 2 contains all atomic formulas and is closed under Boolean combina- 
tions; 

2. if <j> is a formula of QC 2 with at most one free variable, and u is a variable 
(i.e. either x or y), then the formulas Vit</f> and 3u(f) are in QC 2 ; 
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3. if is a formula of QC 2 , 7 a guard-atom, u a variable, and Q any of the 
quantifiers 3, 3<c, 3>c, 3 = c (for C > 0), then the formulas Vu(7 — » 0) 
and Qu(j A 0) are in QC 2 . 

We take the semantics to be the usual semantics of first-order logic, with count- 
ing quantifiers interpreted in the obvious way. If <ft is any formula of QC 2 , we 
denote the number of symbols in by \\4>\\, assuming binary coding of numerical 
quantifier subscripts. 

According to the above syntax, the non-counting quantifiers 3 and V may 
apply without restriction to formulas with at most one free variable; however, 
they may apply to formulas with two free variables only in the presence of a 
guard-atom. By contrast, the counting quantifiers 3<c, 3>c, 3 = c may only 
every apply in the presence of a guard atom (which by definition has two free 
variables). Note in particular that the formula 3 = ixp(x) is not in QC 2 . In fact, 
the next lemma shows that no formula of QC 2 can force a predicate p to be 
uniquely instantiated in its models. 

Definition 1. Let 21, 25 be structures over disjoint domains A, B, respectively, 

interpreting a common signature £ with no individual constants. The union 

21 U S of 21 and 05 is the structure with domain A U B and interpretations 
^aus =ff a Uff « for every a e £. 

Lemma 1. Let <f> be a formula of QC 2 and 21 a structure over the signature of 
<p. For N > 1, let N ■ 21 denote the union of N disjoint copies of 21. If 4> is 
satisfied in 21, then it is satisfied in N ■ 21. 

Proof. If 9 : {x, y} — > A is any variable assignment over A and 1 < i < N, let 8 l 
bo the variable assignment over N ■ 21 which maps x and y to the corresponding 
elements in the ith. copy of 21. A routine structural induction on <f> shows that 
21 he 4> if and only if, for some (= for all) i (1 < i < N), N ■ 21 \= ei (f>. □ 

It follows immediately from Lemma ^ that, if a formula of QC 2 has a finite 
model, then it has arbitrarily large finite models, and indeed infinite models. 

The following lemma, which is a cosmetic modification of Kazakov [H], 
Lemma 2, establishes a normal form for (JC 2 -formulas. 

Lemma 2. Let %p be a formula in QC 2 . We can construct, in time bounded by 
a polynomial function of a formula 

<fi := V.TCK A f\ Vx\/y(e h (x,y) -> ((3 h Vi w y))A 

l<h<l 

/\ Vx3= Ci y{fi{x,y) Axjiy) 

l<i<m 

such that: (i) a is a quantifier-free formula not involving w with x as its only 
variable; (ii) I and m are positive integers; (Hi) for all h (1 < h < I) , eh is a 
binary predicate other than and [3h is a quantifier-free formula not involving 
« with x and y as its only variables; (iv) for all i (I < i < m), Ci is a positive 
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integer, and /, is a binary predicate other than «; (v) <f> is satisfiable if and 
only if i\) is satisfiable; (vi) <p is finitely satisfiable if and only if -0 is finitely 
satisfiable. 

Proof. Standard transformation to Scott normal form, using Lemma ^ Sec 
Kazakov op. cit. for details. □ 

Hence, to show that the (finite) satisfiability problem for QC 2 is in EXP- 
TIME, it suffices to consider only formulas of the form QJ. Furthermore, we 
may assume without loss of generality that no 0-ary predicates (proposition let- 
ters) occur in 0, since we can consider each of the (at most 2^11) truth- value 
assignments to the 0-ary predicates of <p in turn, replacing each 0-ary predicate 
with T or 1 according to its truth- value in the considered assignment. 

Accordingly, fix (f> to be some formula of the form Q over a signature of 
unary and binary predicates. Set C = maxi<i< m Cj, and let £ be the signature 
of <f> together with log((mC) 2 + 1) (rounded up) new unary predicates. Thus, 
|S| is bounded by a polynomial (actually, linear) function of ||</>||. Since £ is the 
only signature we shall be concerned with in the sequel, we generally suppress 
reference to it. Thus, 'predicate' henceforth means 'predicate in £ U {«}', 
'structure' henceforth means 'structure interpreting £', and so on. We keep the 
meanings of the symbols a, /, m, e/j, (3^, (1 < h < /), Ci, /j (1 < i < I), cj>, C, 
£ fixed throughout this paper. The predicates /i, . . . , f m will play a key role 
in the ensuing argument; wc refer to them as the counting predicates. There is 
no restriction on these predicates' occurring in other parts of 4>: in particular, 
they may feature as guards. 

We review some standard concepts. A literal is an atomic formula or the 
negation of an atomic formula. A 1-type is a maximal consistent set of equality- 
free literals involving only the variable x. A 2- type is a maximal consistent set of 
equality-free literals involving only the variables x and y. If r is a 2-type, then 
the result of transposing the variables x and y in r will also be a 2-type, denoted 
r . If 21 is any structure, and a £ A, then there exists a unique 1-type ir(x) 
such that 21 |= it [a]; we denote ir by tp a [a]. If, in addition, b € A is distinct from 
a, then there exists a unique 2-type r(x,y) such that 21 \= r[a, b]; wc denote r 
by tp a [a, b]. We do not define tp a [a, b] if a = b. 

Notation 1. Any 2-type r includes a unique 1-type, denoted tp x (r); in addi- 
tion, we write tp 2 (r) for tp 1 (r _1 ). 

Remark 1. Let % be a structure, and let a, b be distinct elements of A. If 
tp a [a,&] =r, then tp 21 [b, a] =r _1 , tp a [a] =tp 1 (r), andtp m [b] =tp 2 (r). 

Definition 2. Let r be a 2-typc. We say that r is a message-type if, for some 
counting predicate /, (1 < i < m), fi{x,y) G r. If r is a message-type such 
that t -1 is also a message-type, we say that r is invertible; otherwise, r is non- 
invertible. Finally, if r is a 2-type such that neither r nor t _1 is a message- type, 
we say that r is silent. 
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Remark 2. Let 21 be a structure, and let a, b be distinct elements of A. Then 
tp a [a, b] is a message-type just in case 21 |= fi[a,b] for some i (1 < i < to); 
if so, then this message-type is invertible just in case 21 |= f# [b, a] for some i' 
(1 < i' < to). 

The terminology is meant to suggest the following imagery. If tp a [a,6] is a 
message-type \i, then we may imagine that a sends a message (of type /z) to b. 
If ji is invertible, then b replies by sending a message (of type /i _1 ) back to a. 
If tp a [a,6] is silent, then neither element sends a message to the other. 

Definition 3. Let 21 be a structure. We say that 21 is chromatic if distinct 
elements connected by a chain of 1 or 2 invertible message-types have distinct 
1-types. That is, 21 is chromatic just in case, for all a, a' , a" G A: 

1. if a ^ a' and tp a [a, a'] is an invertible message- type, then tp a [a] ^ tp a [a']; 
and 

2. if a, a', a" are pairwisc distinct and both tp a [a,a'] and tp a [a',a"] are in- 
vertible message- types, then tp a [a] ^ tp a [a"]. 

Lemma 3. If <j) has a model, then it has a chromatic model over the same 
domain. 

Proof. Suppose 21 |= (j>, and consider the (undirected) graph G on A whose 
edges are the pairs of distinct elements connected by a chain of 1 or 2 invertible 
message -types. That is, G = (A, E 1 U E 2 ), where 

E 1 ={(a, a') \a ^ a' and tp a [a, a'] is an invertible message-type} 
E 2 ={(a, a") | a 7^ a" and for some a 1 € A, (a, a') and (a 1 , a") are 
both in E 1 }. 

Since C = maxi<i< m Ci, the degree of G (in the normal graph-theoretic sense) 
is at most (mC) 2 . Now use the standard (greedy) algorithm to colour the nodes 
of G with (toC) 2 + 1 colours in such a way that no edge joins two nodes of the 
same colour. By interpreting the log((TOC) 2 + 1) (rounded up) unary predicates 
of E not occurring in (f> to encode these colours, we obtain the desired chromatic 
model. □ 

In the sequel, we shall need to refer to sets of invertible message-types in- 
dexed by bit-strings as follows. Let the 1-types be enumerated as 

II = 7T , . . . , 7Tp_!. 

Evidently, P is a power of 2, so p = logP is an integer. (Actually, p = |E|.) 
Now let s be any bit string (0 < \s\ < p), and denote the string of length by e. 
We inductively define the sub-sequence II S of II by setting II e to be the whole 
of II, and setting II s o and ILi to be the left and right halves of IT, respectively. 
Formally: 



IT = 7TQ, . . . ,7Tp_i, 
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and if II S = Wj, . . . , TTk-i, with |s| < p, 

n s o = ^,...,^^-3 

2 

ILji = TTk-j , -, 7Tfe_-|. 

Thus, if |s| = p, then ILj is a one-element sequence 7Tj, where j is the integer 
(0 < j < P) encoded by the bit-string s in the usual way. To avoid clumsy 
circumlocutions, we occasionally equivocate between bit-strings and the integers 
they encode, thus, for example, writing 7r s instead of Ttj in this case. But we 
will only ever write tt s if \s\ = p. 

We may use the sets IT to define sets of invcrtiblc message-types indexed by 
bit-strings as follows. Fix any 1-type n, and denote by the set of invertible 
message-types A such that tp 1 (A) = 7r. If s is any bit-string such that \s\ < p, 
let 

A*,, = {XeA 7r | tp 2 (A) e n s }. 

Thus, the A„. iS are sets of invertible message-types identified purely by their 
terminal 1-types. Except in very special cases, these sets will contain more than 
one member, even when \s\ = p. However, for chromatic models, we have the 
following important fact. 

Lemma 4. // 21 is chromatic, it = tp a [a] , and s is a bit-string with \s\ = p, 
then there can be at most one element b G A \ {a} such that tp a [a, b] £ A^, s . 

Proof. Any two such elements would be connected by a chain of two invertible 
message- types, and would both have the 1-type 7r s . □ 

Finally, we use bit strings to index other sets of 2-types as follows. Again, 
fix any 1-type 7r, and consider the set of non-invertible message-types ji such 
that tp 1 (^i) — tt. Let these be enumerated in some way as a sequence 

fJ-ufi, ■ ■ ■ j [J-irM-1- 

Furthermore, consider the set of silent 2-types y, such that tp 1 (/x) = tt. Let 
these be enumerated in some way as a sequence 

Thus, the sequence 

M n = /-Vo> • • • i Mtt.q-i- 

is a list of precisely those 2-types r such that tp 1 (r) = tt and r _1 is not a 
message-type. Evidently, R and Q are independent of the choice of tt; moreover, 
Q is a power of 2, so q = log Q is an integer. (We remark that R is not a power of 
2.) Let t be any bit string (0 < |<| < q). We inductively define the sub-sequence 
M^.t of by setting M Wj£ to be the whole of M w , and setting M n>m and M Wjtl 
to be the left and right halves of M w t , respectively. Formally: 



M-K,e — Mt,0) • • • ) Mw,<2-1) 
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and if M„ )t = (i>n,j, ■ ■ ■ ,fJ>n,k-i, with \t\ < q, 

Thus, if |t| = g, then M w t is a one-element sequence where j is the integer 
(0 < j < Q) encoded by the bit-string t in the usual way. Again we may for 
convenience write //^t instead of in this case, but here too we only ever 
write /i ffjt if \t\ = q. 



3 Transformation into an integer constraint 
problem 

Henceforth, vector means "m-dimensional vector over N" . If u and v are vectors, 
we write u < v if every component of u is less than or equal to the corresponding 
component of v; we write u < v if u < v and u =/= v. Similarly for > and >. 
The number of vectors u such that u < C is bounded by (C + l) m , and hence 
by an exponential function of \\cj>\\. 

Referring to the formula QJ, denote the vector (Ci, . . . , C m ) by C and the 
vector (0, . . . , 0) by 0. Moreover, given any 2-type r, we write C r for the vector 
(C Ti i, . . . , C T .m) where, for alii (1 < i < m), 

I otherwise. 

Note that, if r is not a message- type — in particular, if r is a silent 2-type — we 
have C T = 0. 

Let r be any 2-type. Since t is a finite set of formulas with free variables 
x and y, we may write f\ t to denote their conjunction. Referring again to the 
formula l[T]l. we say that t is forbidden, if the formula 

a(x)Aa(y)A /\ {e h {x, y) -»• h ) A /\ r (3) 
l<h<l 

is unsatisfiable. Thus, if 2t |= and a, b are distinct elements of A, then tp a [a, b] 
cannot be forbidden. Since © is purely Boolean, we can evidently identify the 
forbidden 2- types in time bounded by an exponential function of \\(f>\\- 

In the sequel, we take ir to vary over the set of 1-types, A to vary over the 
set of invertible message-types, s to vary over the set of bit-strings of length at 
most p, t to vary over the set of bit-strings of length at most q, and u, v and 
w to vary over the set of vectors < C. (Similarly for their primed counterparts 
7r', A', s' , t', u', v' and w'.) We refer to these sets as the standard ranges of the 
respective letters. Occasionally, additional restrictions on these ranges will be 
imposed. 
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Now let V be the set whose elements are the following (distinct) symbols, 
where the indices A, 7r, s, t, u, v, w vary over their standard ranges: 

y-w,s,v,w whenever \s\ < p, Zn,t,v,w whenever |t| < q. 

The symbols j/7r,s,v,w and £ w ^ iV ,w are not defined when \s\ = p and \t\ = q. The 
cardinality of V is evidently bounded by an exponential function of \\(f>\\. We 
impose some arbitrary order on V, and refer to its elements as variables. If U 
is a non-empty set of variables, enumerated, in order, as {ui, . . . , Ufe}, let U 
denote the term u\ + - ■ -+Uk\ if U is the empty set, let U denote the (constant) 
term 0. In the sequel, we take a constraint to be an equation or inequality 
involving arithmetical terms over V , or a conditional statement formed from 
two such inequalities. A solution of a set of constraints over some numerical 
domain D is simply a function 9 : V — ► D under which all the constraints in 
question evaluate (in the obvious way) to true. 

With this notation, let 8\ be the following set of constraints involving the 
variables V , where 7r, u, v, w again vary over their standard ranges, and s, t 
vary over bit-strings such that \s\ < p and \t\ < q: 





— y7r,e,C-u 




(4) 


y-K,s,u 


— ^ ^{ j/7r,s,v',w' 


■ | v' + w' = u} 


(5) 






| v' + w' = u} 


(6) 


y-7T,sO,V 




| v + w' < C} 


(7) 


2/7T,sl,W 




| v' + w < C} 


(S) 




^ ^{^TT^.V.W 7 


| v + w' < C} 


(9) 


<2"7T,£1,W 




| v' + w < C} 


(10) 


1 


- XX^vx i 


tt' a 1-typc, u' < C}. 


(11) 



Let £2 consist of the following constraints, where A, 7r vary over their stan- 
dard ranges, s, t vary over bit-strings such that \s\ = p, \t\ = q, and u varies 
over vectors such that < u < C: 



U 7T , S , U 




V\i A' <E Ajr.s and C,y = u} 


(12) 


•^7r,t,u 


= 


whenever u is not a scalar multiple of 
C T for r = 


(13) 




= ^A 




(14) 


•'A 


= 


whenever tp-^A) = tp 2 (A) 


(15) 


•''A 


= 


whenever A is forbidden 


(16) 




= 


whenever [1^,1 is forbidden. 


(17) 
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Note that, in (|13fl . t = ^ Tit is a 2-type, and the vector C r is defined according 
to (J2J. If the integer encoded by t is less than R, r = will be a (non- 
invertible) message-type, and we will have C T > 0. If, on the other hand, the 
integer encoded by t is greater than or equal to R, t will be a silent 2-type, and 
we will have C r = 0. In this latter case, no vector u such that u > can be a 
multiple of C T , whence, £2 contains the constraint z^,t,u = for all u such that 
< u < C . 

Let £3 consist of the following constraints, where n varies over all 1-types, 
t varies over bit-strings such that \t\ = q, and u varies over vectors such that 
< u < C: 

Zn,t,u>0 => ^2{y7T',e, u ' I 7r' = tp 2 (^ W)t ) and u' < C} > ZmC. (18) 

Again, if the integer encoded by t is greater than or equal to i?, we have already 
argued that £2 contains the constraint, z„ t t,n — f° r all u such that < u < C, 
rendering the corresponding instances of (|18fl trivial. 

Finally, let £ = £y U £ 2 U£ 3 . 

4 Main result 

Lemma 5. Let <f> and £ be as above. If <f> is finitely satisfiable, then £ has a 
solution over N. 

Proof. Suppose (f> is finitely satisfiable. By Lemma|31 let 21' be a finite, chromatic 
model of cj), and by Lemma^ let 21 = 3mC • 21'. Thus, 21 is also chromatic. Let 
21 have domain A. If it is a 1-type, let A^ = {a G A \ tp a [a] = it}. Now suppose 
a G An- For any bit-string s (0 < |s| < p), define the s- spectrum of a, denoted 
spf [a], to be the vector whose ith component (1 < i < m) is given by 

\{b£A:%\=fi[a,b},b^a and tp a [a, 6] G K,s}\- 

For any bit-string t (0 < \t\ < q), define the t-tally of a, denoted tlf [a], to be 
the vector whose ith component (1 < i < m) is given by 

|{6 G A : 21 h /<[a,6], & ^ a and tp a [a,6] G M^}|. 

These quantities are easiest to understand when the strings s and £ take the 
value e. For any a G A, sp^[a] is the vector whose ith component records the 
number of elements b to which a sends a message of invertible type containing 
the atom fi(x 7 y). Likewise, tl e [a] is the vector whose zth component records 
the number of elements b to which a sends a message of non- invertible type 
containing the atom fi(x,y). If tp 21 ^] = it and < \s\ < p, then spj[a] is 
obtained in the same way as sp;f [a] , except that we discount all messages whose 
type is not a member of A^g. Likewise, if tp a [a] = ir and < \t\ < q, then 
tlf [a] is obtained in the same way as tlf [a] , except that we discount all messages 
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whose type is not a member of M Wi {. It is then easy to see that, for all nei, 
all bit-strings s (\s\ < p) and all bit-strings t (\t\ < q), 



Let 9 : V -> N be defined by: 



0{xx) 


= K« 


£ A : 


there exists 6 G A \ {a} such that tp a [a, b] = 


A}| 




- K« 




:spf[a]=u}| 




0(Z7T,t,u) 


= l{« 


G A„- 


:tl?[a] = u}| 




^(y7T,S,V,w) 


= \{a 


G A w 


: s P?oH = v an d s Psi[ a ] = w l whenever s 


<P 


^(^7r,t,v,w) 


= \{a 


G A^ 


: tl^[a] = v and tlfja] = w} whenever \t\ 


< g, 



where the indices A, 7r, s, i, u, v and w vary over their standard ranges. 

We show that 9 is a solution of £ by checking the forms (|^jl- H18|l in turn. 
The constraints follow easily from Equations H19|) - (|21J) . respectively. 

The constraints JjJ-Jinjl are immediate. In the (single) constraint H 1 If) - the 
sum on the right-hand side evaluates under 9 to the cardinality of A, which is 
positive by definition. To understand the constraints i|12|) , fix any ir and s (with 
|s| = p), and write, for any A € A OT)S , 

A\ = {a E A | there exists b G A \ {a} such that tp a [a, b] = A}. 

Thus, A\ C Ayr, and = 9(x\). By Lemma 0J for any a g there can 
be at most one element b G A \ {a} such that tp a [a,6] G A x s . It follows that 
the sets A\, for A varying over A ffiS , are pairwise disjoint, and, moreover, that 
a G has s-spectrum u > if any only if it is a member of some (hence, 
exactly one) of these sets A\, with C\ = u. That is, for all u > 0, 



with the sets on the right-hand side pairwise disjoint. The relevant instance 
of 112|) is then immediate from the definition of 9. To see why the con- 
straints Ijl 3(1 hold, note that, if \t\ = q, then M Tji is a set containing precisely one 
2-type t = /U^, which is either a non-invcrtible message- type or a silent 2-typc. 
Either way, for every a G A^ , tlf [a] must be a scalar multiple (possibly zero) of 
C r . In other words, if u > is not a scalar multiple of C T , then 9(z w ,t,u) = 0. 
To understand the constraints ((14(1 . observe that, since 21 is chromatic, 9(x\) 
is actually the total number of messages of (invertiblc) type A sent by elements 
of 21, and similarly for 9(x(\-i)); and these numbers are obviously equal. The 
constraints 1(15(1 are immediate given that 21 is chromatic. The constraints 1(16(1 
and ((17(1 are immediate given that 21 1= 4>. To understand the constraints 1(18(1. 
fix any n, t and u such that \t\ = q and u > 0. If z IitiU > holds under 9, then 
/Lt OTj t is a (non-invertible) message-type; and furthermore, at least one message 



spf[a]+tlf[a] 

spfoH+spfiW 
tlfoH+tlfja] 




c 



(19) 
(20) 
(21) 



{a G A„ | bp? [a] = u} = |J{Av | A' G A„ and C 



A' = U}, 
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of that type must be sent in 21, so that 21 contains at least one element whose 
1-type is tp 2 (//*•,*) and hence — since 21 = 3mC-2t' — at least 3mC such elements. 
But the exact number of elements in A whose 1-type is tp 2 (/x w t ) is given by the 
value, under 8, of XX2/wV,u' I 7r ' = tP2(A t ir,t) an( l u' < C}. □ 

In establishing the converse of Lemma the following technical result con- 
cerning solutions of £ will prove useful. To avoid notational clutter, we use the 
variable names x\, y-w.s,u, z„ t t,u> Vtt.s.v.w, z-x,t,v,w to stand for the corresponding 
natural numbers in some such solution (and similarly for terms involving these 
variables). 

Lemma 6. Let x\, yn. s ,u, z^.t.-u, 2Ar.s,v.w, z^^.v.-w {with indices having the 
appropriate ranges) be natural numbers satisfying the constraints £ given above. 
Fix any 1-type ix , and let A n be a set of cardinality ^2{yn,e,u> | u' < C}. Then 
there exists a system of functions on A v 

U, s ■ A n -> {u | u < C} g n>t : A v {u | u < C}, 

where the indices s and t vary over their standard ranges, such that, for all 
vectors u < C, 

|/"j(u)| = y^ u (22) 
|. 9 -j(u)| = z v , t , u , (23) 

and such that, for all a G A^ , 

Y,{f.,Aa):\ S '\=p} + Y,{9«,t'(a)--\t'\=q} = C. (24) 



Proof. Decompose the set A n into pairwise disjoint (possibly empty) sets A u 
such that \A U \ = y-^.^u, where the index u varies over all vectors < C. This is 
possible by the cardinality of A n . For all u < C, and all a € A u , set 

f v>e (a) = u g^ e (a) = C - u. 

This assignment evidently satisfies (1221) for s = e; and by the constraints 10} , it 
also satisfies <|2"5|) for t = e. We observe in passing that, for all aei„, 

A,«(o)+fl Wie (o) = C. (25) 

We now construct the functions f v>s , where < \s\ < p, by induction on s. 
Assume that, for some s (0 < |s| < p), f vs has been defined and satisfies (|22|) . 
For every vector u < C, decompose f~l(u) into pairwise disjoint (possibly 
empty) sets A v vv such that |A V W | = y^s.v.w, where the indices v, w vary over 
all vectors satisfying v + w = u. This is possible by the constraints JSJ) together 
with the assumption that /^ iS satisfies <|22[) . Having thus decomposed the sets 
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f v l(u) (for all u < C), we see that, for any a G A n , there is precisely one 
(ordered) pair of vectors v, w such that a € A v vf ; hence we may set 

U,so(a) = v U,ai(a) = w. 

This defines the functions fir.so and f n ,si- We observe in passing that, for all 
a G A n , 

U,so(a) + U,al(a) = U,s(a)- (26) 

To see that f VlS o and f v , s i both satisfy Equation note that f-K, s o{a) = v 
if and only if, for some vector w' such that v + w' < C, a G A v w /. Similarly, 
f-K,si(a) = w if and only if, for some vector v' such that v' + w < C, a G A V / )W . 
That is, 

/^o(v) = U^v.w |v + w'<C} 

i7ii(w) = Ui^,w |V + w<c}, 

with the collections of sets on the respective right-hand sides being pairwise 
disjoint. By the constraints 101-©, together with the fact that |-A V , W | = 2/7r,s.v,w 
for all v, w, we have: 

I.Cjo( v )l = 2/*-, s o,v 

l/^sl( w )l = 2Ar, S l,w, 

which establishes (|22|l for the functions /^so and f v s \- This completes the in- 
duction. The construction of the functions g^j proceeds completely analogously, 
using Constraints JJjJ, © and (|1C)|> . In carrying out this latter construction, we 
obtain, in a parallel way to l|26|l . 

ffir,to(a) + SW,ti(a) = ^(a), (27) 

for all bit-strings i (|t| < q) and all a G A n . 

It remains to establish Ij24(l . We prove the stronger result that, for all a G A^, 
j (0 < j < p) and fc (0 < k < q), 

^{/^(a):| S '|=i} + ^W(a):|t'| = fc} = C, (28) 

using a double induction on j and k. If j = k = 0, then the left-hand side 
of (|2*8)l is simply /^(a) + .9ir,e(a), which is equal to C by J2HJ). Suppose now 
that the result holds for the pair j, k, with j < p. Then 

J2{ua*) ■■ w\ = u + 1)} + : = fc > 

= ]T{A, S 'o(«) + f*,M<*) : \ s '\ = ft + X){fl*,t'(a) : = ^} 

= : i s 'i = 3} + E^-.*'( a ) : = fc > b y 09 

= C by inductive hypothesis. 

This establishes the result for the pair j+l,k. An analogous argument using l|27[) 
applies when k < m, completing the induction. □ 
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Before we come to the promised converse of Lemma \5\ we remark on the 
(exponentially many) choices made during the construction of the various func- 
tions f Wt3 and g Vjt m the proof of Lemma [5] — specifically, in the decomposition 
of certain sets into collections of subsets. It is because of this large number 
of independent choices that solutions of £ typically encode not one, but many, 
models of <j>. 

Lemma 7. Let 4> and £ be as above. If £ has a solution over N, then (f> is 
finitely satisfiable. 

Proof. Suppose £ has a solution over N. Again, we use the variable names x\, 
2/tt,s,u, ^7r,t,uj Z/tt.s.v.w, ^7r,t,v,w to stand for the corresponding values in some 
such solution. Our task is to construct a model 21 of cf). 

For each 1-type ir, let A„ be a set of cardinality ^2{yn,e,u | u < C}, with 
the An pairwise disjoint; and let A = ^{A^ \ tt a 1-type}. Think of A n as the 
set of elements of A which 'want' to have 1-type tt. By the constraint (|ll(l . 
A ^ 0. For every 1-type tt, let the functions f^. s and g^.t be constructed as 
in Lemma we are interested only in those /„■ s and g^ t t where \s\ = p, and 
|t| = q. For all such tt, s, t, and all a G A„, think of /„• a (a) as the s-spectrum 
which a 'wants' to have, and think of g^^ifi) as the t-tally which a 'wants' to 
have. Finally, consider any set /~*(u), where < u < C and \s\ — p. Using 
the constraints lfT^|l and Equation (J22J, we can decompose f~l(u) into pairwise 
disjoint (possibly empty) sets A\ with \A\\ = x\, where A varies over the set 
of invertible message-types such that A G A WjS and Ca = u. It follows that, if 
a G A\, with A G A,r )S , then Ca = f-K.sia)- Think of A\ as the set of elements 
of A^ which 'want' to send a single message of (invertible) type A. 

Before proceeding, we pause to consider the construction just described in 
respect of any of the sets A^. Fixing, for the moment, some bit-string s with 
[s| = p, we see that A^ is decomposed into the pairwise disjoint sets /~^(u) 
(as u varies over vectors such that u < C), and that each of the sets f~l(u), 
where < u < C, is further decomposed into the pairwise disjoint subsets 
A\ (as A varies over the elements of A IiS such that C\ = u). Note that the 
set f~l(0) is not subject to this further stage of decomposition. This process 
is performed for every bit string s with \s\ = p, so that different values of s 
lead to independent — and possibly overlapping — decompositions, as illustrated 
in Fig. H Likewise, for every bit-string t with \t\ = </, A^ is decomposed into 
the pairwise disjoint sets g~)(u) (as u varies over vectors such that u < C). 
Again, decompositions corresponding to different values of t should be thought 
of as independent of each other. 

We now proceed to construct, for every a G j4, a data-structure recording a 
collection of messages sent by a, each of which is labelled with some (invertible 
or non-invertible) message-type. (These data-structures will later be combined 
to form a structure 21 on A) Recall that, if tt is any 1-type, then p-„,o, ■ ■ ■ , P-n,R-i 
is an enumeration of the non-invertible message-types p such that tp x (/i) = tt. 
Fix a G A, and let tt be the unique 1-type such that a G A„. The messages 
sent by a shall be as follows, (i) For every bit-string s such that \s\ = p, if 
f-K,s{o) > Oj let X a , s be the invertible message-type A G A w s such that a G A\ 
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/^l(u) 



Ax, where A 6 A^,. 
and Ca = u 



!/ _1 /(u') 



"-A^i, where A' 6 A^y 
and C X i ~ u' 



/ _1 '(0) 



Figure 1: The decompositions of A^ for the strings s and s' . 



(hence Ca = fn,s(ci)), and let a send a single message labelled A ajS . Note that, 
if f-n,s{a) > 0, then A QiS exists and is unique by the construction of the sets A\. 
(ii) For every bit string t such that \t\ = q, if t encodes an integer less than R 
(so that fi = fiirj is a non-invcrtiblc message-type and C M > 0), let n a ,t be the 
unique natural number n such that g^^ifl) = nC^ 7 and let a send n a t distinct 
messages labelled /i^.t. Note that, if g 7Vt t( a ) = 0; then n a t = 0; on the other 
hand, if g n ^{a) > 0, then n ai t exists by the constraints (|13f) and Equation (|23|) . 
The resulting data-structure is depicted in Fig. [21 where, for readability, we 
have replaced any bit-strings by the integers they conventionally denote. For 
all a £ A and all i (1 < i < m), let G a ,i (1 < i < ?ti) be the number of messages 
sent by a having any label v for which fi(x,y) £ i/, and furthermore let C a be 
the vector (C a ,i, ■ ■ ■ , C a;m ). By inspection of Fig. 

c a = x;{a,-'(o) : = pi + £w(°) : = 

and so, by Equation 

C a = C. (29) 
We now build 21 in four steps as follows. 

Step 1 (Fixing the 1-types): For all 1-types w and all a £ A n , set tp [a] = tt. 
Since the ^ are pairwise disjoint, no clashes arise. 

Step 2 (Fixing the invertible message- types): Let A be any invertible message- 
type. By construction, exactly \A\\ = x\ elements of A send some message 
labelled with A, and each of those elements sends exactly one such message. 
Hence, the number of messages labelled with A (over all a £ A) is x\; likewise, 
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f 




n a ,o 



Figure 2: The messages sent by a G A^. For each j (0 < j < P), a may or 
may not send a message labelled X a .j (hence the dotted lines); if it does, then 
X a .j G A-ir.j- For each k (0 < k < R), a sends n a ^ messages labelled ^,k] but 
the numbers n a ,k can be zero. 

the number of messages labelled with A" 1 is x x -i- By the constraints l|14|l . we 
may put the A-labclled messages and the A~ ^labelled messages in 1-1 corre- 
spondence. If a G A sends a A-labelled message, let b G A send the corresponding 
A _1 -labclled message, and set tp [a, b] = A. For this assignment to make sense, 
we need to check that a and b are distinct. But, by construction, we must have 
x\ > 0, whence, by the constraints l(T3|) . tp 1 (A) ^ tp 2 (A), so that A Pl (A) an d 
^4 t p 2 (^) are disjoint sets containing a and 6, respectively. Thus, the assignment 
tp 21 ^, b] = A makes sense, and does not clash the with 1-typc assignments in 
Step 1. We can think of the element b as 'receiving' the message sent by a (and 
vice versa). Moreover, by construction, for every 1-type 7r', o sends at most 
one message labelled with an invertible message- type A' such that tp 2 (A') = tt'. 
Therefore, there is no chance that these assignments clash with each other. 

Step 3 (Fixing the non-invcrtible message- types): As a preliminary, for every 1- 
type 7r, we decompose A^ into three pairwise disjoint (possibly empty) sets A Vj q, 
A Ki i and A-^^ satisfying the condition that, if A^ \ > 3mC, then \ A^.j \ > mC for 
all j (0 < j < 2). Now let [i be any non-invertible message-type, let 7r = tp 1 (/x), 
and let p = tp 2 (/i). (Note that n and p may be identical.) Let t be the 
bit-string of length q such that p\, = /i^t, and suppose some element a sends 



4 MAIN RESULT 



16 




Figure 3: Fixing the non-invertible message- types. 



n a t > messages labelled p. It follows that a G A n , and also that there is a 
vector u > such that g-K,t( a ) = u - and hence such that g~\{\i) is non-empty. 
By Equation (|23[) . z n ,t,u > 0, whence, by the constraints (|18fl . 5Z{y P ,e.u' I u ' — 
C} > 3mC. But recall that, since p is a 1-type, |^4 p | = ^2{y p ,e,n' I u ' — C}, so 
that each of the sets A p ^, A Pi \ and A Pi 2 contains at least mC elements. Since 
a G An, let j (0 < j < 2) be such that a G A^j, let fc = j + 1 (mod 3), and 
select n a> t elements b from A p ^ which have not yet been chosen to receive any 
other messages (invertible or non-invertible) sent by a. Since the total number 
of messages sent by a is certainly at most mC, we never run out of choices. For 
each of these elements b, set tp a [a, b] = p. Since ir = tp 1 (^t) and p = tp 2 (/i), 
these assignments cannot clash with those made in Step 1, and by construction, 
they cannot clash with assignments corresponding to other messages sent by a. 
We need only check that they cannot clash with assignments corresponding to 
messages sent by b. Specifically, we must ensure that, if tp a [a, b] = p, is assigned 
as just described, it is not possible for a to be chosen to receive a //-labelled 
message sent by b, where p! is some non-invertible message-type. But any p! - 
labelled message sent by b G A Pt k, with tp 2 (//) = n, could only be sent to an 
element in A^ji, where j' = k + 1 (mod 3); and by assumption, A^j and A^ji 
are disjoint, (Fig. QJ. Observe that this conclusion follows even if tt = p. 

Step 4 (Fixing the remaining 2-types): Recall that a guard-atom is any atom 
p(x, y) or p(y, x), where p is a binary predicate. If tp a [a, b] has not been defined, 
set it to be the 2-type 

tt U p[y/x] U {-17 I 7 is a guard-atom not involving «}, 

where tt — tp a [a], p = tp a [6], and p[y/x] is the result of replacing x by y in p. 
Note that neither this 2-type nor its inverse is a message- type. Note also that, 
since a and b certainly send some messages, the constraints (|16fl and l|17ll ensure 
that both a A /\ vr and a A f\ p are satisfiable. 

This completes the definition of 21; it remains to show that 21 |= (j). Referring 
to we consider first the conjuncts: 

Vxa A f\ Vx\/y(e h (x,y) -» (J3 h V x w y)). 

l<h<l 

We see from the constraints H16|) and (|17|) that no 2-type assignment in Steps 2 
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and 3 violates these conjuncts. And it is obvious that no assignment in Step 4 
does so. (This is where we use the guardedness of (j>, of course.) Finally, we 
consider the conjuncts 

f\ Vx3 =Ci y(fi(x,y) Ax 56 y). 

l<2<m 

To see that these conjuncts are all satisfied, it suffices to note Equation Q29|l 
and the fact that none of the 2-types assigned in Step 4 is a message-type. We 
remark in passing that 21 is chromatic. □ 

The constraints £ all have the forms 

X\ + ••• + !„ = X 

xi H h x n > 1 , . 

x = [M) 

x > x\ H hi„ > D, 

where n > 0, x, x±, . . . , x n are variables, and D is a positive integer. We measure 
the size \\£\\ of £ in the usual way, with binary encoding of the constants D. 

The following lemma essentially repeats Lutz, Sattler and Tendera [S], Propo- 
sition 11. (Those authors in turn credit Calvanese [3].) We repeat the proof for 
convenience. 

Lemma 8. Let <p and £ be as above. An algorithm exists to determine, in time 
bounded by an exponential function of \\4>\\, whether £ has a solution over N. 

Proof. Suppose £ has a solution 9 : V — > N. We define the integer programming 
problem £g by replacing every constraint in £ having the form x > x\ + 
• • • + x n > D with one of two corresponding inequalities as follows: 

x = if 6{x) = 

X\ + • • • + x n > D otherwise. 

It is easy to check that 9 is a solution of £g, and, moreover, that every solution 
of £g is a solution of £. Since £g is an integer programming problem and has a 
solution over N, by a well-known theorem (Papadimitriou |1U|1. it has a solution 
9' over N in which every value is bounded by a (positive) integer H, where H 
can be computed (as a binary string) in time bounded by a polynomial function 
of \\£e\\ and hence in time bounded by an exponential function of \\4>\\. (Of 
course, the integer H is bounded only by an doubly exponential function of 
\\4>\\.) Moreover, 9' must also be a solution of £. Therefore £ too has a solution 
over N if and only if it has a solution over N in which every value is bounded 
by H. 

Now consider the integer programming problem £h defined by replacing 
every constraint of the form x > xi + - ■ -+x n > D in £ by the corresponding 
inequalities 

Hy > x 
x\ H Yx n > Dy, 
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where y is a new variable. Every solution of £h over N is a solution of £. 
Moreover, suppose 9' is any solution of £ over N in which all values are bounded 
by H; and let y be one of the new variables of £h, introduced to eliminate the 
constraint x > => x\ + ■ ■ ■ + x n > D. Let us extend 0' to give a value to y as 
follows: 

'o if0'(x) = O 
1 otherwise. 



m = 



It is routine to check that extending 9' in this way for all the new variables y in 
£h yields a solution of £jj. Hence £ can be transformed, in time bounded by 
an exponential function of ||<^||, into the equisatisfiable (over N) constraint set 
£ h , in which all constraints are of the forms 

x\ + • • • + x n = x x = 

X\ + ■ • • + x n > 1 Dx\ > xi 

X\ + ■ • ■ + x n > Dx, 

where, again, the D are positive integers. It is obvious that, if £h has a solution 
over the non-negative rationals, then it has a solution over N as well. (Simply 
multiply by the product of all the denominators.) Hence, we can equivalently 
regard £r as a linear programming problem. But linear programming is in 
PTIME, by Khachiyan's theorem [7j. □ 

Theorem 1. The finite satisfiability problem for QC 2 is in EXPTIME. 

Proof. Lemmas H El □ and El □ 



5 The Satisfiability Problem 

The above technique also provides a simple proof of a result derived in Kaza- 
kov 0, namely, that the satisfiability problem for QC 2 is in EXPTIME. 

Notation 2. Let N* denote the set NU{H }. We extend the ordering > and the 
arithmetic operations + and • from N to N* in the obvious way. Specifically, we 
define K > n for all n G N; we define K + K = Ho-^o = and 0-H = H o -0 = 0; 
we define n + Ko = Ho + n = Ho for all n £ N; and we define n ■ Ho = Ho • n = Ho 
for all n £ N such that n > 0. Under this extension, > remains a total order, 
and +, • remain associative and commutative. 

Consider again the constraints £ given in ijljl— l|18|l. but now with the variables 
ranging over the whole of N*. Using the arithmetic in Notation^ the reasoning 
of Lemmas EH3 works unproblcmatically even when countably infinite sets are 
allowed. Thus, we have: 

Lemma 9. Let <j> and £ be as above. Then <p is satisfiable if and only if £ has 
a solution over N* . 

Proof. If 4> is satisfiable, then it has a model which is finite or countably infinite. 
Now proceed as for Lemma |SJ For the converse, proceed as for Lemma O 
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Lemma 10. The set of constraints £ has a solution over W if and only if it 
has a solution over {0, Ho}. 

Proof. Suppose Z has a solution 6 : V — > N*. By considering the forms in £, 
we see that 6' : V — > {0, H } defined by = Ho6*'(w) is also a solution. The 
other direction is trivial. □ 

Since the domain {0, H } has only 2-elements, variables interpreted over it 
are essentially Boolean. If x € V, let us write X for the corresponding statement 
x = 0, so that the constraints £ are viewed as formulas of propositional logic. 
For example, a constraint of the form 

x\ -I V x„ = x 

becomes the set of Boolean formulas 

{Xi A • ■ • A X n -> X} U {X ->• Xi | 1 < i < n}- 

a constraint of the form 

xi H + x n > 1 

becomes the Boolean formula 

Xi A 

and a constraint of the form 

x > => 

becomes the Boolean formula 

Xi A • • • A X n — > X. 

A quick check reveals that all of the resulting formulas are Horn-clauses. 
This immediately yields: 

Theorem 2 (Kazakov). The satisfiability problem for QC 2 is in EXPTIME. 

The proof in Kazakov [H] proceeds by showing that satisfiability in QC 2 can be 
reduced in polynomial time to satisfiability in the 3- variable guarded fragment; 
Theorem then follows by the complexity bound for the latter established by 
Gradel The approach taken here is thus somewhat more direct. Moreover, 
Kazakov's reduction is not conservative, and, as mentioned, yields no complexity 
bound for the corresponding finite satisfiability problem. 
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